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Description 

MULTIPLE STEP ADAPTIVE METHOD FOR 

TIME SCALING 

Background of Invention 
[000 1 ] 1. Field of the Invention 

[0002] The present invention relates to a signal-synthesizing 

method, and more particularly, to a multiple step adaptive 
method for time-scaling. 

[0003] 2. Description of the Prior Art 

[0004] d ug t 0 the dramatic progress in electronic technologies, 
an AV player such as a Karaoke can provide more and 
more amazing functions, such as audio clean-up, dynamic 
repositioning of enhanced audio and music (DREAM), and 
time scaling. Time scaling (also called time stretching, 
time compression/expansion, or time correction) is a 
function to elongate or shorten an audio signal while 
keeping the pitch of the audio signal approximately un- 
changed. In short, time scaling only adjusts the tempo of 



an audio signal. 

[0005] | n general, an AV player performs time scaling with one of 
three following methods: Phase Vocoder, Minimum Per- 
ceived Loss Time Expansion/Compression (MPEX), and 
Time Domain Harmonic Scaling (TDHS). Phase Vocoder 
transforms an audio signal into a complex Fourier repre- 
sentation signal with Short Time Fourier Transform (STFT) 
and further transforms the complex Fourier representa- 
tion signal back to a time scaled audio signal correspond- 
ing to the original audio signal with interpolation tech- 
niques and iSTFT (inverse STFT). MPEX is a method re- 
searched and developed by Prosoniq for simulating char- 
acteristics of human hearing, similar to artificial neural 
network. MPEX records audio signals received for a prede- 
termined period and tries to"learn" the audio signals, so 
as to either elongate or shorten the audio signals. TDHS is 
one of the most popular methods for time scaling. TDHS 
first establishes an autocorrelogram of a first audio signal, 
the autocorrelogram consisting of a plurality of magni- 
tudes, and then delays the first audio signal by a maxi- 
mum index corresponding to a maximum magnitude, a 
largest magnitude among all of the magnitudes of the au- 
tocorrelogram, to form a second audio signal, and lastly 



synchronizes and overlap-adds (SOLA) the first audio sig- 
nal to the second audio signal to form a third audio signal 
longer than the first audio signal. 
[0006] please refer to Fig.l, which is an autocorrelogram 10 for 
TDHS according to the prior art, the autocorrelogram 10 
consisting of a plurality of magnitudes. In general, besides 
a maximum magnitude 12 and magnitudes there away, 
remaining magnitudes in the autocorrelogram 10 has a 
small value. In addition, two neighboring magnitudes of 
the autocorrelogram 10 differ slightly. For example, if a 
first magnitude 14 is far smaller than the maximum mag- 
nitude 12, a second magnitude 16 neighboring the first 
magnitude 14 is also far smaller than the maximum mag- 
nitude 12. On the contrary, if a third magnitude 18 differs 
slightly from the maximum magnitude 12, a fourth mag- 
nitude 20 neighboring the third magnitude 18 is probably 
very close to the maximum magnitude 12 and accordingly 
a fourth index 




(corresponding to the third 18 or fourth magnitude 20 as 
shown in Fig.l) is also probably very close to a maximum 
index 



man 



corresponding to the maximum magnitude 12. 
[0007] | n a computer system, the autocorrelogram 10 is usually 
established by a digital signal processing (DSP) chip de- 
signed to manage complex mathematic calculation such 
as convolution and fast Fourier transform (FFT). However, 
a process to determine the maximum magnitude 12 and 
the corresponding maximum index 



man 



by establishing the autocorrelogram 10 with a DSP chip is 

tedious and sometimes unnecessary. 
Summary of Invention 

[0008] it is therefore a primary objective of the claimed invention 
to provide a multiple level adaptive method for time scal- 
ing capable of determining a maximum index corre- 
sponding to S [n] and S [n] signals efficiently and synthe- 
sizing an S [n] signalfrom the S [n] and S [n] signals. 

[0009] According to the claimed invention, the method comprises 
following steps:(a) calculating a first magnitude of a 
cross-correlation function of the S [n] signal and the S [n] 



signal according to a first index; (b) comparing the first 
magnitude with a threshold value; (c) if the first magni- 
tude is smaller than the threshold value, calculating a first 
reference magnitude of the cross-correlation function of 
the S [n] signal and the S [n] signal according to a first 
reference index behind the first index by a first deter- 
mined number, or calculating a second reference magni- 
tude of the cross-correlation function of the S [n] signal 
and the S [n] signal according to a second reference index 
behind the first index by a second number; and (d) syn- 
thesizing the S [n] signal by adding the S [n] signal to the 

3 1 

S [n] signal in accordance with a maximum index corre- 
sponding to the largest magnitude among all of the mag- 
nitudes calculated in step (c). 

[0010] | n the preferred embodiment of the present invention, the 
first predetermined number is larger than one, while the 
second predetermined number is equal to one. 

[° 01 1 ] It is an advantage of the claimed invention that a DSP chip 
does not have to calculate all of the magnitudes in an au- 
tocorrelogram, thus saving time to establish the autocor- 
relogram and promoting the efficiency of a computer 
where the DSP chip is installed in. 

[0012] These and other objectives of the claimed invention will 



no doubt become obvious to those of ordinary skill in the 

art after reading the following detailed description of the 

preferred embodiment that is illustrated in the various 

figures and drawings. 
Brief Description of Drawings 

[0013] pig.l is an autocorrelogram for TDHS according to the 

prior art. 

[0014] pig. 2 is an autocorrelogram corresponding to a method 

according to the present invention. 
[0015] Fig. 3 is a flow chart demonstrating a method according to 

the present invention. 
[0016] Fig. 4 is a schematic diagram demonstrating how the 

method synthesizes an S [n] signal from an S [n] signal 

1 

and an S [n] signal according to the present invention. 
[0017] Fig. 5 is a schematic diagram demonstrating how the 

method elongates an audio signal according to the 

present invention. 
[0018] Fig. 6 is a schematic diagram demonstrating how the 

method shortens an audio signal according to the present 

invention. 
Detailed Description 



[0019] | n a process of establishing an autocorrelogram of a first 



audio signal and a second audio signal, a method 100 of 
the preferred embodiment of the present invention com- 
pares a magnitude corresponding to an index in the auto- 
correlogram with either a first threshold th or a second 
threshold th , the first threshold th smaller than the sec- 

2 1 

ond threshold th , and calculates magnitudes correspond- 
ing to indexes following the index in the autocorrelogram. 
In detail, if a first magnitude 



R(ii) 



in the autocorrelogram is smaller than the first threshold 
th , indicating a first index corresponding to the first 
magnitude 



RUi) 



is still far from a maximum magnitude 



R ( T max) 



corresponding to a maximum index 



man 



, the method 100 calculates a second magnitude 



R ( T 2 ) 



corresponding to a second index 



T 2 



lagging the first index 



T i 



by a first predetermined number A ; If a third magnitude 



RU 3 ) 



in the autocorrelogram is larger than the first threshold th 
but still smaller than the second threshold th . indicating 
a third index 




corresponding to the third magnitude 



R ( T 3 ) 



is closer to the maximum index 



man 



than the first index 



T i 



, the method 100 calculates a fourth magnitude 



R(r 4 ) 



corresponding to a fourth index 




lagging the third index 




by a second predetermined numberA , the second prede- 
termined numberA 2 smaller than the first predetermined 
numberA^ If a fifth magnitude 



R(Ts) 



in the autocorrelogram is larger than the second threshold 
th 2 , indicating a fifth index 



T 5 



corresponding to the fifth magnitude 



R(Ts) 



is quite close to the maximum index 



man 



, the method 100 calculates a sixth magnitude 



R( T e) 



corresponding to a sixth index 




right after the fifth index 



T 5 



[0020] please refer to Fig. 2 and Fig. 3. Fig. 2 is an autocorrelogram 

30 corresponding to the method 100 according to the 
present invention. Fig. 3 is a flow chart demonstrating the 
method 100 according to the present invention. The 
method 100 comprises following steps: 

[0021] step 102:Start;(An S [n] signal is to be synthesized from 
an S [n] signal and an S [n] signal. For simplicity, the S [n] 
signal and S [n] signals are both defined to contain N sig- 
nals. Of course, the numbers of signals the S [n] signal 
and S [n] signal contain can be different.) 

[0022] step 103:Delaying the S [n] signal by a predetermined 



number A and forming an S [n] signal;(ln order to prevent 
run-in from occurring in a process a pickup of an A/V 
player reads the S [n] signal, the method 100 delays the S 
[n] signal by the predetermined number A and then de- 
termines the maximum index 

^ max 



crucial for the process to synthesize the S [n] signal from 
the S [n] signal and the S [n] signal. In the preferred em- 
bodiment, the predetermined number A is equal to [N/3].) 
[0023] step 104:Calculating an initial magnitude R(l) corre- 
sponding to an initial index 



Z i( Z =D 



corresponding tothe S [n] signal and the S [n] signal, set- 
ting a determinant magnitude R to be the initial magni- 

c 

tude R(l), and setting a determinant index 




corresponding to the determinant magnitude R to be the 

c 

initial index 



T i 



;(The initial magnitude R(l) is equal to 



f; s, [«] * s 2 [» + 1] 

n=0 



.) 

[0024] step 106:lf 



( r c = N-l) 



, then go to step 200, else go to step 108;( 




equal to N-l, indicates the determinant magnitude R is 
the last magnitude in the autocorrelogram 30. The auto- 
correlogram 30 is completely established.) 
[0025] step 108:Comparing the determinant magnitude R with 

c 

either the first threshold th or second threshold th . If 

1 2 

the determinant magnitude R^ is smaller than the first 
threshold th (as the R(l) shown in Fig. 2), then go to step 
110; If the determinant magnitude R falls on a region be- 

c 

tween the first threshold th and the second threshold th , 

1 2 

then go to step 140; If the determinant magnitude R is 

c 

larger than the second threshold th , then go to step 
170;(lf the determinant magnitude R is larger than the 

c 



second threshold th , indicating the determinant index 

I r- 



corresponding to the determinant magnitude R is located 

c 

on a region nearby the maximum index 



man 



, then the method 100 calculates magnitudes corres 
ing to indexes right after the determinant index 




(as a magnitude R( 



R( T i) 



corresponding to an index 




shown in Fig. 2), or the method 100 neglects the calcula- 
tion of magnitudes corresponding to indexes following 
the determinant index 




and calculates magnitudes corresponding to indexes lag- 
ging the determinant index 




by the first predetermined numberA^r second predeter- 
mined numberA 2 directly to save the time for a DSP chip 
to calculate magnitudes in the autocorrelogram 30. Please 
note that, in order to find out the maximum index 



man 



corresponding to the maximum magnitude R exactly, 

max 

the first threshold th and second threshold th can not 

1 2 

be defined to have too large values in the beginning to 
calculate the maximum index 



man 



according to the method 100. For example, if the second 
threshold th 2 is set to be a third threshold th initially, af- 
ter calculating the 



R( T j) 



, the method 100, according to the decision performed in 
the step 108, calculates a magnitude 



R( x j + A 2 ) 



instead of calculating a magnitude 



R ( T j + 1) 



and in the end does not calculate the exact magnitude 



R ( 7" max) 



, but obtains a magnitude 



R ( T man) 



instead, a wrong index 



z 



max 



corresponding to the magnitude 



R ( T man) 



is therefore used to synthesize the S 3 [n] signal from the S 
[n] and S [n] signals.) 

1 5 

[0026] step 110:Setting magnitudes 



R(k i c <k<i c +Ai f if k<N) 



to be zero and the determinant index 




to be( 




+A1) and calculating the determinant magnitude 



R ( T c) 



corresponding to the determinant index 




of the SJn] and S [n] signals; go to step 106;(The deter- 
minant magnitude 



R ( T c) 



is equal to 




) 

[0027] st e p i40:Setting magnitudes 



R(k i c <Kr c +A 2 , if k<N) 



to be zero and the determinant index 




to be( 




+A2) and calculating the determinant magnitude 



R ( T c) 



corresponding to the determinant index 




of the S [n] and S [n] signals; go to step 106; 

1 5 

[0028] step 170:Setting the determinant index 



(Tc + 1) 



and calculating the determinant magnitude 



R ( T c) 



corresponding to the determinant index 




of the S [n] and S [n] signals; go to step 106; 

1 5 

[0029] st e p 200:Determining the maximum index 



man 



corresponding to the maximum magnitude R in the 

max 

autocorrelogram 30; 
[0030] step 202:Delaying the S [n] signal by the maximum index 



man 



and forming an SJn] signal; 
[0031] step 204:Weighing the SJn] signal and adding to the S 4 [n] 
signal and forming the S [n] signal;(The S [n] signal = S 
[n] signal, where 0 <= n < ([N/3] + 



man 



); = (N-n)/(N-([N/3]+ 



man 



))*S,[n] + (n-([N/3]+ ))/(N-([N/3]+ 

1 max 



man 



))*S Jn-([N/3]+ 

4 



man 



)], where ([N/3]+ 



man 



) <= n < N; = S >-([N/3]+ 

4 



man 



)], where N <= n <= (N+[N/3]+ 



man 



)) 

[0032] step 300:Updating the first threshold th and second 
threshold th based on the maximum magnitude R ; 

2 max 

and(Since the S [n] and S [n] signals are both derived from 
an S[n] derived from an original signal S (an audio or 

org 

video signal), any sampling signals in the S[n] following 
the S [n] and S [n] signals, such as an S [n] signal and an 

12 6 

S [n] signal, have certain characteristics similar to those of 
the S [n] and S [n] signals. Therefore, the maximum mag- 
nitude R calculated in step 200 can be used to be an 

max 

updating reference to update the first threshold th and 
the second threshold th needed for the synthesizing of 



the S [n] and S [n] signals, omitting the necessity to set 

6 7 

too small the first threshold th and second threshold th 

1 2 

from calculating the wrong maximum index 



z r 



max 



, too small the first threshold th and second threshold th 

1 2 

increasing the burden for the DSP chip to calculate unnec- 
essary magnitudes.) 
[0033] Step 302:End. 

[0034] please refer to Fig. 4, which is a schematic diagram 

demonstrating how the method synthesizes the S [n] sig- 
nal from the S [n] and S [n] signals according to the 
present invention. In Fig. 4, a first part 400 shows the S [n] 



and S [n] signals in the step 102 of the method 100, a 
second part 402 shows the maximum index 

t max 



and the SJn] signal calculated from the step 103 to step 
202 of the method 100, and a third part 404 shows the S 
[n] signal synthesized from the S [n] and S [n] signals in 
the step 204 of the method 100. 
[0035] | n the preferred embodiment of the present invention, the 

magnitudes 



R(k T <k< i+A r2f if k<N) 



calculated in the steps 110 and 114 of the method 100 
are all set to be zero. However, these magnitudes can be 
set to be any values, equal or different from each other, as 
long as these values are all smaller, preferably far smaller, 
than the maximum magnitude R 

max 

[0036] if the S [n] signal is the same as the S [n] signal and both 
are derived from the S[n] at an identical region, as shown 
in Fig. 5, the method 100 in fact elongates the S [n]. On 
the contrary, if the S [n] signal and the S [n] signals are 
different from each other and are derived from the S[n] at 
two distinct regions respectively, as shown in Fig. 6, the 

method 100 in fact combines and shortens the S [n], an S 

l 



[n] (discarded) and the S [n] signals into the S [n] signal. 
[0037] | n contrast to the prior art, the method of the present in- 
vention compares a temporary magnitude (R ) in an auto- 

c 

correlogram with a threshold (th or th ) and calculates 
magnitudes corresponding to indexes lagging a tempo- 
rary index corresponding to the temporary magnitude by 
a predetermined number without calculating all magni- 
tudes in the autocorrelogram, saving time for a DSP chip 
to calculate the maximum index 



max 



and therefore promoting the efficiency of a computer 
where the DSP chip is installed in accordingly. In the pre- 
ferred embodiment of the present invention, the first pre- 



determined number is 24 while the second predetermined 

number is 6, the first threshold th and the second 

1 

thresholds th can be set to be R /2 and R /4 respec- 

2 max max 

tively, that is numbers truncating the maximum magni- 
tude R by one and two bits respectively, and count of 

max 

the calculation can be reduced to ten percent without im- 
pacting quality of the S [n] signal. 
[0038] Following the detailed description of the present invention 
above, those skilled in the art will readily observe that nu- 
merous modifications and alterations of the device may be 
made while retaining the teachings of the invention. Ac- 
cordingly, the above disclosure should be construed as 
limited only by the metes and bounds of the appended 
claims. 



